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Novel Tricistronic Vectors and Uses Therefor 

This application claims priority to U.S. provisional application serial number 
60/399,150 filed July 30, 2002. The entirety of this application is hereby expressly 
5 incorporated by reference. 

Background of the Invention 

Field of the invention 

The field of the invention relates generally to the expression of recombinant DNA. 
10 More particularly, the invention relates to novel vectors (and uses therefor) that can be used 
to express at least three exogenous genes under the control of a single promoter. 
Background 

A persistent problem associated with expression of multiple, indi vidual recombinant 
polypeptides (i.e. polypeptides that are not fused to each other) via a vector in an expression 

15 system is obtaining satisfactory yields of each polypeptide. This is especially true, for - 
example, when the goal is to express multiple proteins that associate with each other upon 
expression, where poor yield of one or more of the components will hamper or prevent 
association of the expressed proteins. 

The cloning, transformation and expression efficiencies of a vector typically are 

20 inversely related to its size, and therefore one common strategy for expressing multiple 
polypeptides in an expression system is to use multiple vectors instead of "overloading" a 
single vector. This approach has drawbacks, however. For instance, short of employing a 
selection protocol for each vector, there is no way to determine wife certainty that a cell 
contains each vector. In addition, vector incompatibility can hinder obtaining suitable 

25 expression levels even where there is satisfactory vector uptake by the cells. 
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A separate approach is to integrate each exogenous gene into a single construct, but 
under the control of multiple promoters within that construct. This strategy, too, is riddled 
with disadvantages. For example, obtaining suitable expression requires successful function 
of multiple promoters, which can be difficult to achieve. Accordingly, there is no way to 
5 determine with certainty that a cell contains sufficient levels of each recombinant 

polypeptide, short of employing a selection protocol for each gene expression product 
operatively linked to its respective promoter. Furthermore, utilizing one promoter per 
exogenous gene disad vantageously results in a relatively large vector. Placing all cistrons 
into a single vector under the control of a single promoter has not been a viable option in 
10 nearly all applications, since, e.g. , the further a cistron is positioned from its promoter, the 
less likely is the chance that acceptable expression yields will be obtained for that cistron. 

Certain tricistronic vectors are known in the art, however. For example, Burger et aL, 
AppL Microbiol Bioiechnol (1999) 52: 345-353 reported a tricistronic vector that encoded , 
in a 5-prime to 3-prime orientation, (i) a murine light chain Ig, (ii) a murine heavy chain Ig- 
15 TNFa fusion and (iii) puromycin acetyltransferase (>ac) as a selective marker. Burger e/ a/, 
stated that the foregoing tricistronic vector was selected because "expression of the selective 
marker and product are strictly linked" (id. at 351, rt. col.). 

However, in Burger et dl. , the non-Ig polypeptide (i. e. , pac) functioned only as a 
selection vehicle and, hence, did not otherwise associate or otherwise interact with either the 
20 murine light chain Ig or heavy chain Ig-TNFa fusion. Accordingly, Burger et aL provides no 
suggestion that three "structural'* polypeptide domains could be expressed in sufficient yields 
so as to associate or otherwise interact with each other after expression. In other words, the 
disclosure by Burger et aL did not overcome the prejudice in the art against using a 
tricistronic vector to express three or more polypeptide domains that associate or otherwise 
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interact with each other subsequent to expression. It is apparent, therefore, that a vector that 
satisfies these and other drawbacks known in the art is greatly to be desired. The present 
invention provides such vectors, together with methods for their use. 

Summary of the Invention 

Accordingly, it is an object of the invention to provide enhanced expression vehicles 
for generating at least three polypeptide molecules that can interact with each other 
subsequent to expression. 

It is a further object of the invention to provide enhanced expression vehicles that ate 
compatible with a variety of prokaiyotic hosts. 

It is still a further object of the invention to provide methods of using the foregoing 
expression vehicles to discover new and improved therapeutics for treating disease. 

These and other objects are made possible with reference to the teachings contained 

herein. 

In one aspect, the invention provides a tricistronic vector construct that comprises a 
prokaryotic promoter, a first nucleic acid sequence encoding an immunoglobulin-presenting 
polypeptide, a second nucleic acid sequence encoding a first immunoglobulin (Ig) 
polypeptide, a third nucleic acid sequence encoding a second Ig polypeptide; a first 
associating agent fused to or comprised within said Ig-presenting polypeptide, and a second 
associating agent fused to or comprised within said first Ig polypeptide. The first, second and 
third nucleic acid sequences are under the control of said promoter and, upon expression of 
the tricistronic vector, the Ig-presenting polypeptide and the first Ig polypeptide associate via 
their respective associating agents and the first and second Ig polypeptides self-associate. 
The vector may optionally be a phagemid vector. 
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In one embodiment, the Ig-presenting polypeptide may be a phage coat protein, for 
example, a gTO protein or a functional fragment of a gin protein. The gm functional 
fragment may contain an N-terminal domain of gin. 

In another embodiment, the first and second Ig polypeptides self-associate to form a 
5 Fab or other functional Ig fragment, for example via a disulfide bond. The first and/or second 
associating agent may be a cysteine residue. 

In still another embodiment, the first and second Ig polypeptides self-associate Via 
non-covalent interactions. 

In other embodiments, the vector contains (i) a first secretoiy signal sequence in the 
10 * same reading frame as the nucleic acid sequence encoding the first Ig polypeptide, and/or a 
second secretory signal sequence in the same reading frame as the nucleic acid sequence 
encoding the second Ig polypeptide, and/or a third secretory signal sequence in the same 
reading frame as the nucleic acid sequence encoding the Ig-presenting polypeptide. The first, 
second and third secretory signal sequences may be prokaiyotic signal sequences. The vector 
15 may further contain a ribosome binding site positioned 5-primeward of any or all of the 

nucleic acid sequences encoding the second Ig polypeptide, the first Ig polypeptide and/or the 
Ig-presenting polypeptide. 

In still further embodiments, the associating agents become disassociated in solution 
upon the addition of a reducing agent Alternatively, the second associating agent is fused to 
20 said first Ig polypeptide via a peptide linker. 

The following text provides a more detailed, but non-limiting description of the 
present invention^ 
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Brief Description of the Figures 
Figure 1 is a schematic depiction of principal components of an inventive tricistronic 
vector, i.e., a single promoter, an Ig-presenting polypeptide, and two Ig polypeptides. 
Abbreviations: Lac p/o lac promoter operator region; SS gpIH signal sequence, gin phage gene IE; 
5 RBS Ribosomal binding site; ompA outer membrane protein A signal sequence; phoA alkaline 
phosphatase signal sequence; L-His6 PGGSGH6 linker. 

Figure 2A is a vector map of an illustrative vector according to the present invention. 
Figure 2B provides the nucleic acid sequence for the vector described in Figure 2a. 
Figure 3 is a gel that represents a quantitative analysis (by anti-glUp Western blot) of 
10 the mean display rate of Fab on the surfaces of phage . 

Figure 4A is a gel that represents the display rate of a monocistronic scFv vector 
(pMORPHl 3) encoding scFvs from a VL-X pool (conventional display). 

Figure 4B is a gel that represents the display rate of a monocistronic scFv vector 
(pMORPHl 3) encoding scFvs from a VL-k pool (conventional display). - 
15 Figure 4C is a Vector map for pMorphl3 scFv Macl-5 

Figure 4D is the nucleic acid sequence for pMoiphl3 scFv Macl-5 
Figure 5 A is a gel that represents the display rate of a dicistronic scFv vector 
(pMORPH20) encoding scFvs from a VL-X pool (display via Cys residues). 

Figure SB is a gel that represents the display rate of a dicistronic scFv vector 
20 (pMORPH20) encoding scFvs from a VL-k pool (display via Cys residues). 
Figure 5C is a Vector map for pMoiph20 Macl-5 
Figure 5D is the nucleic acid sequence for pMorph20 Macl-5 
Figure 6A is a gel that represents the display rate of a dicistronic Fab vector 
(pMORPHl 8) encoding a Fab of framework combination VH2 X-l ; (conventional display). 
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Figure 6B is a gel that represents the display rate of a dicistronic Fab vector 

(pMORPHl 8) encoding a Fab of framework combination VH3 k-1 ; (conventional display). 
Figure 6C is a Vector map of pMORPH^18-Fab Macl-5 
Figure 6D is the nucleic acid sequence for pMORPH® 1 8-Fab Mac 1-5 
5 Figure 7A is a gel that represents the display rate of a dicistronic Fab vector, using a 

two-vector system (pMORPHXIO & pBR_C_gm) and encoding a Fab of framework 

combination VH3 jc-1, respectively (display via Cys residues). 

Figure 7B is a gel that represents the display rate of a dicistronic Fab vector, using a 

two-vector system (pMORPHXIO & pBR^C^glll) and encoding a Fab of framework 
10 combination VH2k-1, respectively (display via Cys residues). 

Figure 7C is the vector map for pMORPHXIO Fab Mac 1-5 VL LHC VH FS 
Figure 7D is the nucleic acid sequence for pMORPHXIO Fab Macl-5 VL LHC VH 

FS 

Figure 7E is the vector map for pMORPHXIO Fab Macl-5 VL VH LHC 
1 5 Figure 7F is the nucleic acid sequence for pMORPHXIO Fab Macl-5 VL VH LHC 

Figure 7G is the vector map for pBR-C-gjm 

Figure 7H is the nucleic acid sequence for pBR-C-gin 

Figure 8A is a gel that represents the display rate of a tricistronic Fab vector 
(pMORPH23) encoding a Fab pool (framework combinations VH3 jc/X). 
20 Figure 8B is a gel that represents the display rate of a tricistronic Fab vector 

(pMORPH23) encoding a Fab pool (framework combinations VH3 k/X). 

Figure 9 is a bar graph comparing the functionality and the binding efficiency of Fab- 
presenting phage of (i) dicistronic Cys display vectors (2-vector system), (ii) tricistronic Cys 
display vectors, and (iii) dicistronic conventional display vectors in phage ELISA. 

6 



WO 2004/013276 PCT/IB2003/003681 

Detailed Description 
The present invention provides novel tricistronic vectors that are useful in multiple 
contexts. The inventors surprisingly found that tricistronic vectors may constructed to 
express three polypeptide molecules in a suitable yield under the control of a single promoter, 
5 with the additional feature that the expressed polypeptide domains can maintain function and 
interact with each other. Another surprising was the observation that all three polypeptides 
could be exported to the periplasm of a prokaryotic host following expression in the host' s 
cytosol/cytoplasm, and that the expressed polypeptides could interact or otherwise associate 
in the periplasmic space. Vectors according to the present invention are suitable for use in a 
10 number of prokaryotic expression systems. 

A. Components of a Vector of the Invention 
The components of a tricistronic vector of the present invention include: (i) nucleic 
acid sequences encoding three polypeptide molecules (non-fused to each other) and (ii) a 
single promoter that controls expression of all three polypeptides. The polypeptide-encoding 
1 5 nucleic acid sequences encode, for example, (i) an immunoglobulin (Ig)-presenting 

polypeptide domain, (ii) a first Ig domain, and (iii) a second Ig domain. In addition, a vector 
of the invention preferably contains a ribosome binding site S'-ward of each of the foregoing 
polypeptide molecules, which can enhance expression levels. Upon expression, the two Ig 
domains associate to form a functional immunoglobulin fragment, which further associates 
20 with the Ig-presenting domain, thereby permitting, for example, display of the functional 
immunoglobulin fragment on the surface of a filamentous phage. 

A vector of the invention may optionally contain nucleic acid sequences encoding at 
least two associating agents, one of which can be fused to (or comprised within) the Ig- 
presenting polypeptide; and the other of which can be fused to (or comprised within) the first 
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Ig polypeptide or second Ig polypeptide. Preferably, subsequent to expression of the vector, 
the Ig-presenting polypeptide and an Ig polypeptide interact with each other via their 
respective associating agents, and the two Ig polypeptides associate, e.g., by self-association, 
hydrogen bonding, van der Waals forces, or via an associating agent(s). The foregoing 
5 interaction and association interaction and association preferably occur in the periplasm of 
the prokaryotic host however, the invention also contemplates association and interaction in 
the host* s cytosol. 

a. Promoter: 

As used herein, a "promoter" for use in a tricistronic vector of the invention is a 
10 promoter that is capable of driving the expression of (i.e. that is functionally linked to) a 1 
nucleic acid construct that encodes at least three independent polypeptide molecules (e.g. 9 an 
Ig-presenting domain and two Ig domains), where those polypeptides are not expressed as 
fusion proteins with each other. Suitable promoter for use in the invention include, but are 
not limited to, the lac/operon promoter, CMV promoter Pbad, Pus, Para, Padhu Pgal» Pef-Kx, 
15 Psv40,EM-7 promoter, Ptefi, Prsv, PubO 

Prokaryotic promoters of the invention can be either constitutive or, more preferably, 
regulatable inducible or derepressible). Further examples of suitable prokaryotic 
promoters include promoters capable of recognizing the T4 (Malik et al. 9 J. Biol. Chem. 
(1984) 263:1174-1181; Rosenberg etal 9 Gene (1987) 59:191-200; Shinedling et al. 9 J. 
20 Molec. Biol. (1987) 195:471-480; Hu et al 9 Gene (1986) 42:21-30), T3, Sp6, and T7 

(Chamberlin et al. 9 Nature (1970) 228:227-23 1 ; Bailey et al. 9 Proc. Natl. Acad. Set (U.S.A.) 
(1983) 8024:2814-2818; Davanlook et al., Proc. Natl. Acad. ScL (U.SA.) (1984) 81:2035- 
2039) polymerases; the P R and Pl promoters of bacteriophage lambda (THE BACTERIOPHAGE 
Lambda, Hershey, A. D., Ed., Cold Spring Harbor Press, Cold Spring Harbor, N.Y.<1973); 
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Lambda n, Hendrix, R. W., Ed., Cold Spring Harbor Press, Cold Spring Haibor, N.Y. 
(1980)); the tip, recA, heat shock, and lacZ promoters of E. coli\ the ce-amylase (Ulmanen et 
al. 9 J. Bacteriol (1985) 162:176-182) and the E28-specific promoters of A subtilis (Gilman 
et al. 9 Gene (1984) 32:1 1-20); the promoters of the bacteriophages of Bacillus (Gryczan, T. 

5 J., In: The Molecular Biology Of The Bacilli, Academic Press, Inc., NY (1982)); 
Streptomyces promoters (Ward etaL y Mol Gen. Genet. (1986) 203:468-478); the int 
promoter of bacteriophage lambda; the bla promoter of the /3-lactamase gene of pBR322, and 
the CAT promoter of the chloramphenicol acetyl transferase gene of pBR325, etc. 
Prokaryotic promoters are reviewed by Glick, B. R. (J. Indust. Microbiol. (1987) 1:277-282); 

1 0 Cenatiempo, Y. (Biochimie (1 986) 68:505-5 1 6); Watson, J. D., (In: MOLECULAR BIOLOGY OF 
THE Gene, 4th Ed., B enj amin/Cummings Publishing Co., Inc., Menlo Park, Calif (1987)); 
Gottesman, S. (Ann. Rev. Genet. (1984) 18:415-442)). Other prokaryotic promoters that may 
be used include other E. coli promoters (Harley et aL, Nuel. Acid Res. (1987) 15:2343-2361), 
~" and Streptomyces promoters (Strohl, Nucl. Acid Res. (1992) 20:961-974) for use in 

1 5 Streptomyces species expression hosts. All of the foregoing references are incorporated by 
reference. 

b. Immunoglobulin-presenting polypeptide: 

An "immunoglobuUn-presenting" or "Ig-presenting" polypeptide or polypeptide 
domain, as used herein, is a (polypeptide or protein/polypeptide domain that can interact 
20 with at least one immunoglobulin polypeptide, such that the immvuioglobulin(s) are able to 
specifically bind, or are involved in the process of specifically binding, an antigen. An Ig- 
presenting polypeptide preferably interacts with an Ig domain via an associating moiety that 
customarily is fused to (or contained within) the Ig-presenting domain. 
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Suitable Ig-presenting domains include a phage coat (capsid) protein, for example a 
filamentous phage coat protein. A suitable phage coat protein can be, for example, -gene HI 
protein (gJIIp), gene VI protein (gVIp), gene VII protein (gVIIp), gene Vm protein (gVIIIp), 
and gene IX protein (gEXp). A preferred phage coat protein is glllp. A phage coat protein 
5 may be either a wild type or a modified protein. A "wild type phage coat protein" refers to 
any protein forming the phage coat of a naturally occurring bacteriophage. The sequences of 
the foregoing phage coat proteins (including the differences between the closely related 
members of the filamentous bacteriophages such as fl , fd, and M13) are well known to those 
of skill in the art (see, e.g., Kay et al> 1 996). The skilled artisan will recognize that other Ig- 

10 presenting domains are suitable for use in the present invention. * 

An Ig-presenting polypeptide of the invention also may be a truncated or modified 
variant of a phage coat protein (e.g., the C-terminal domain of glllp). In this regard, a 
"truncated" or "modified" variant (or a functional fragment thereof) refers to any phage coat 
protein that has been modified by deleting, inserting and/or substituting at least part of the 

15 wild type sequences. Examples of such variants include truncated gene HI protein variants 
that have been found in bacteriophage mutants (see, for example, Crissman & Smith, 1984) 
or that have been generated for use in phage display methods (e.g. Bass et al, 1990; Krebber, 
1996). 

The invention also contemplates the use of other Ig-presenting polypeptides. An Ig- 
20 presenting polypeptide also may be a green fluorescent protein (gfp), any protein of the cell 
surface or of the cell wall of bacterial cell, or any protein of a bacteriophage or virus coat 
c. Immunoglobulin or "Ig" polypeptide or domain 
An "immunoglobulin" or "Ig" polypeptide or domain hereby is defined as a domain 
of the protein class IgG, IgM, IgE, IgA, and IgD (and any subclass thereof), and includes all 

10 
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conventionally known antibodies and functional fragments thereof. A "functional fragment" 
refers to a fragment of an immunoglobulin which retains the antigen-binding moiety of an 
immunoglobulin. A preferred class of immunoglobulins for use in the present invention is 
IgG. More specifically, an immunoglobulin domain of the invention can include the domain 
5 of (i) a F(aV)2, fragment, or (ii) a Fab fragment The F(ab')2» or Fab may be engineered to 
minimize the intermolecular disulphide interactions that occur between the Chi and Cl 
domains. An Ig polypeptide may have an amino acid sequence derived from that of an 
antibody isolated from nature or derived from a natural source, or may have a sequence that 
is designed in silico and encoded by a nucleic acid that is synthetically created. In silico 

10 design of an antibody sequence can be achieved, for example, by analyzing a database of 
sequences and devising a polypeptide sequence utilizing the data obtained therefrom. 
Methods for designing and obtaining such in si/ico-created sequences are described, for 
example, in U.S. Patent No. 6,300,064 to Knappik et aL 9 which hereby is incorporated by 
reference in its entirety. 

15 A tricistronic vector of the invention preferably encodes two Ig polypeptides that 

interact with each other and form a functional (antigen-binding) molecule. Interaction 
between the two Ig polypeptides typically is mediated by residues that belong to each Ig 
polypeptide. To this end, the first and second polypeptide can comprise heavy and light 
chain regions of an antibody that associate via non-covalent interactions between 

20 corresponding heavy and light chain domains, such as between VH and VL in an Fv 

fragment, or between VH/VL and CH1/CL in a Fab fragment Additionally, heavy and light 
chain regions of an antibody may associate by forming disulphide bonds between the two 
chains, such as is possible in a Fab fragment The present invention specifically contemplates 
the interaction of two Ig polypeptides by mechanisms other than formation of one or more 
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inter-chain disulfide bonds , e.g., via a linker moiety that is non-covalently attached to at least 
one of the Ig domains, via hydrogen bonds, via van der Waals interactions, or via peptidic 
association domains fused to the Ig polypeptides, such as described in U.S. Patent No. 
6,294,353 to Pack et al., which hereby is incorporated by reference. 
5 An "associating agent" for use in the present invention is defined as an agent that can 

bring about the interaction between expressed Ig-presenting and Ig polypeptides. An 
expressed associating agent of the invention is fused to, or comprised within, an Ig-presenting 
polypeptide and a complementary associating agent is fused to, or comprised within, an Ig 
polypeptide. The foregoing associating agents may be two different agents, or may be two 
10 identical or substantially identical agents. An associating agent according to the invention 
preferably contains a cysteine residue that is available for the formation of an intermolecular 
disulphide linkage. 

Preferably, the associating agents are selected so that they do not interfere with the 
desired function of the fully associated protein complex. Typically, therefore, the associating 

1 5 agents are suitable amino acid residues that are located outside the region(s) deemed to be 

responsible for a putative function of the (poly)peptide/protein of interest such as binding to a 
target For example, a cysteine residue that is intended to form an inter-chain disulfide bond 
is positioned at, or in the vicinity of, either the N- or the C-terminus of a polypeptide. 
Other suitable associating agents include those which (i) can be fused to the C- 

20 terminal end of an Ig polypeptide (or within about 1 5 amino acid residues thereof) and (ii) 
can interact with an associating agent fused to or comprised within an Ig-presenting 
polypeptide. Likewise, suitable agents include any which (i) can be fused N-terminally to (or 
comprised within) an Ig-presenting polypeptide (eg., phage coat protein) and (ii) can interact 

12 
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with an associating agent fused to or comprised within an Ig polypeptide. A specific example 
of a pair of associating agents in this regard is an avidin-biotin complex.. 

In the context of the present invention, a cysteine residue is "available for the 
formation of an intermolecular disulfide bond" if the residue is (i) located N-tenninal, C- 
5 terminal* or internal to a polypeptide and (ii) accessible for the formation of a disulfide bond 
with a second residue of the same or different kind. This includes cysteine residues that are 
buried, and thus not accessible in the "final" polypeptide molecule, but which are accessible 
in an intermediate compound formed in the course of expression, processing and/or transport 
in a host cell. 

10 In one embodiment, two associating agents may associate, or attach, by the formation 

of a disulfide bond between (i) at least one cysteine residue present in an Ig polypeptide and 
(ii) a second cysteine residue present within an Ig-presenting domain that is a wild type phage 
coat protein. In the case of filamentous bacteriophage fd, for example, wild type proteins 
contain the following cysteine residues: Cys7, Cys36, Cys46, CysS3, Cysl 88, Cys201 , 

15 Cys354, and Cys371 of protein HI; residue Cys84 of protein VI; residue Cys22 of protein 
VII; residue Cysl 6 of protein DC Any one or more of these residues may act as an 
associating agent 

A tricistronic vector of the invention also may contain one or more ribosome binding 
sites. A ribosomal binding site (Shine-Dalgarno sequence) is a purine rich sequence that in 
20 on bacterial mRNA is located about ten nucleotides 5-primeward of the initiator codon for a 
particular polypeptide. A Shine-Dalgarno sequence is involved in the binding of the 
ribosome and the mediating of efficient translation of the respective-gene. 

A tricistronic vector of the invention also may contain one or more nucleic acid 
sequences that encode a signal or secretory polypeptide. A "signal*' or "secretory" 

13 
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polypeptide hereby is defined as a polypeptide responsible for transporting another 
polypeptide from bacterial cytosol to bacterial periplasm. A signal or secretory polypeptide 
of the invention preferably is located N-terminal to the polypeptide to be transported to the 
periplasm. The use of one or more secretory polypeptides can be especially advantageous in 
5 the context of phage display technology, as described, infra, whereby the secretory 
polypeptide (i) is linked to a encoded polypeptide, and (ii) directs the corresponding 
polypeptide to the periplasmic space of its prokaryotic host cell. Secretory polypeptides 
include, for example, ompA and phoA, gene DI signal sequence, st II, and pelB, each of 
which can be used in a prokaryotic expression system. Other nucleic acid sequences 

10 encoding secretory peptide sequences are well known in the art and may also be used in the 
present invention. In one aspect of the invention, a secretory nucleic acid sequence 
ompA) is linked to the nucleic acid sequence that encodes a first Ig domain, while a second 
secretory nucleic acid sequence (e.g., phoA) is linked to the nucleic acid sequence that 
encodes a second Ig domain. A secretory nucleic acid sequence also can be linked to the 

1 5 nucleic acid sequence that encodes an Ig-presenting polypeptide. Alternatively, the secretory 
domain can be an inherent property of an Ig-presenting domain of the invention. 

A tricistronic vector of the invention also may contain one or more nucleic acid 
sequences that can encode a "polypeptide linker'* that functions to link an associating agent to 
an Ig-presenting and/or an Ig domain. In this context, the linker can be viewed as a "spacer" 

20 between an associating agent and its respective polypeptide. This linker preferably-contains 
about 1-50 amino acids, and preferably 5 to 15 amino acids. Typically, a linker consists of 
glycine-serine rich stretches, but can also contain other amino acid residues. The size can 
also be variable according to the propose of the linker. 
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A tricistronic vector of the invention also can be constructed so as to contain one or 
more affinity tags (e.g., His6 tag) that is fused to one of the Ig domains, for example. An 
affinity tag can be used to purify or isolate a population of Ig molecules bearing this tag. 

A tricistronic vector of the invention also can be constructed so as to contain one or 
5 more restriction sites that facilitate cloning, sub-cloning, or other manipulation of the vector. 
For example, when a plurality of restriction sites are present, unique restriction sites can be 
engineered to flank a particular segment of the vector, thereby making the vector modular. 
The feature of modularity can be advantageous, e.g. , for subsequent modification of the 
tricistronic vector at one or more discrete positions. According to this approach, a particular 

10 segment of the tricistronic vector can be excised and substituted with another desired 
segment, using convention technology. A library such as the HuCAL antibody library 
described in U.S. Patent No. 6,300,064 to Knappik et aL 9 is particularly preferred for use in a 
vector of the present invention. 

An illustrative, non-limiting embodiment of a vector according to the invention 

1 5 (pMORPH23) is set forth in Figure 2A. According to Figure 2 A, pMORPH23 contains a 
ColEI origin of replication, a functional origin for single stranded replication, and a 
chloramphenicol-resistance gene. The tricistronic operon is under the control of an inducible 
lac promoter/operator region. All functional modules are flanked by unique restriction sites. 
The first expression cassette contains the signal sequence of genelH, and the engineered full- 

20 length (mature) genem sequence with an additional N-terminal cysteine residue. The second 
expression cassette, which is preceded by a ribosomal binding site (SD-Seq), encodes the 
light chain of an Ig and contains the bacterial signal sequence ompA followed by VL and CL. 
The third expression cassette, which is preceded by a ribosomal binding site (SD-Seq), 
contains a heavy chain Fd (VH1 + CHI) with an additional C-terminal cysteine. The 

15 
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bacterial signal sequence phoA is followed by VH1 and CHI, whereby a glycine/serine-rich 
linker and a His6-tag act as a spacer for the introduced cysteine to the Fd chain.. 

B. Constructing a Vector of the Invention 

Methods for constructing vectors comprising nucleic acid molecules are known in the 
5 art (see, e.g. 9 Sambrooke* a/., 1989; Ausubel et al> 1994). A vector map of a representative 
vector of the invention (pMORPH23) is provided in Figure 2A, with its nucleic acid sequence 
provided in Figure 2B. 

C. Representative Uses of a Vector According to the Invention 

A tricistronic vector of the invention can be used, or can be modified to be used, in a 
10 variety of prokaryotic expression systems. A suitable host cell is any cell that permits 

expression and subsequent interaction of the three principal polypeptide domains (i.e., Ig- 
presenting and two Ig domains)* Methods for introducing vectors into appropriately chosen 
host cells, and causing or allowing the expression of polypeptides are known in the art (see, 
e.#., Sambrook et a/., 1989; Ausubel et aQ 1994). 
15 A vector according to the invention, is particularly suited for expression in aa E.coli 

host cell. In this regard, the vector can be in the form of a phagemid vector. A phagemid 
consists of elements of conventional plasmid vectors marker gene, cloned genes, 
plasmid origin of replication) and of elements of filamentous phage (e.g., gin, PS and phage 
or*). A phagemid can be introduced into a host cell, and subsequently be cultivated and 
20 amplified therein like a plasmid. Phagemid vectors are well known in the art 

A phagemid does not encode all of the genes necessary to permit assembly of viral 
particles and requires "rescue" in a host cell with a helper phage. The helper phage provides 
the missing phage genes that permit assembly of the viral particles. It will be appreciated that 
use of a phagemid/helper phage system an result in production of particles that contain either 
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the helper phage genome or the phagemid. Methods of preferentially packaging the 
phagemid are well known in the art, for example by using a helper phage, such as Ml 3 K07 
that contains a functional, but defective, DNA origin of replication so that phagemid is 
preferentially packaged into phagemid particles. Methods for the introduction of -genetic 
5 material required to produce progeny phage or phagemid particles in appropriate host cells, 
and for causing or allowing the generation of such particles are well known in the art (see, 
e.g., Kay et al, eds. (1996) Phage display of peptides and proteins: a laboratory 
manual. Academic Press, Inc., San Diego). 

A vector of the invention can, accordingly, be use to carry out a method for producing 

10 a polypeptide or protein having a desired property. This method includes the steps of (i) 
providing a collection of bacteriophage particles that present on their surface a diverse 
collection of one or more Ig polypeptides as defined herein; and (ii) -screening and/or 
selecting the diverse collection for at least one Ig domain having the desired property. Here, 
the term "desired property" refers to a property that (a) one of the polypeptides or proteins out 

15 of the diverse collection should have and (b) forms the basis for screening and/or selecting 
the diverse collection. A property might be the ability to: bind a target, block a target, or 
activate a target-mediated reaction. A further property may be, for example, enzymatic 
activity, or any other properties known to those skilled in the art Methods for identifying 
suitable experimental formats and for carrying out necessary steps for performing screening 

20 and/or selection are well known in the art 

A preferred property of an Ig polypeptide is specific binding to a target The target 
can be presented to the diverse collection of bacteriophage particles in a variety of ways well 
known to one of ordinary skill, for example, by coating on surfaces for solid phase 
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biopanning, by linkage to particles such as magnetic beads for biopanning in solution, or by 
display on the surface of cells for whole cell biopanning. 

Bacteriophage particles that display (via an Ig-presenting domain) one or more Ig 
polypeptides (which are bound to a target) can be recovered by a variety of methods well 
5 known to one of ordinary skill. If the associating agents link the Ig-presenting polypeptide 
and Ig polypeptide via a disulfide bond, then the specifically bound Ig-target complexes can 
be treated under reducing conditions (e.g. 9 incubation with DTT) to cleave the disulfide bonds 
and to recover the specific bacteriophage particles for further rounds of biopanning and/or for 
identification of the Ig polypeptide domains specifically binding to said target 
10 Examples 

The present invention can be better understood with reference to the following 

examples, which are not intended to limit the scope of the invention as described above. 

Example 1: General protocol for quantitative analysis of display of antibody 
fragments on phage 

15 The protocol, which also applies to Examples 2-6, was performed according to 

Johansen, L. K. et al (1995), Protein Engiig. 8, 10, 1063-1067. Different dilutions of the 
same phage preparation were subjected to a protein gel. However, in Examples 3, S and 7, no 
reducing agents were added, due to the presence of cysteines as associating agents. The 
proteins of the protein gel were transferred to a membrane. gTTTp protein on the membrane 

20 was detected by anti-gHIp antibody (Western blot). Then, one wild-type gene HI protein 

( 4C wtgIIIp") band and one band of the antibody-glllp linkage, which have the same intensity 

were analysed. Given (i) the number of phages loaded, (ii) the molar ratio of both proteins, 

and (iii) the assumption of 5 wtgJIIp-proteins per phage, the mean number of antibody 

fragments displayed per phage could be calculated. Figure 1 provides expression data of a 

25 dicistronic Fab vector (pMORPH18) using conventional (i.e., glllp-fusion) display. The data 
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indicate a ratio of Fd-gIIIct:wtgHIp = 1 .25x10* phage:2xl0 7 phage. This correlates to the 
presentation of 1 Fd-glUct per 6*25 wtglllp; in other words, 1 Fd-glllctper 1.25 wtgjDDEp. 
Accordingly, the mean number of Fabs per phage in this experiment was 0.8. 

Abbreviations throughout Fd = VH-CH1; Fd-ct = Fd-gTOct & VH-GHl-gHIct; g3p = 

5 glHp. 

Example 2: Display determination of monocistronic scFv vector, using conventional 
display 

A protocol as disclosed in Example 1 was carried out for performing a quantitative 
display analysis of a monocistronic scFv vector (pMORPH13), using conventional phage display. 

1 0 Figure 4 A provides expression data of the (pMORPH 1 3) vector from a VL-X pool; and 

Figure 4B provides expression data of the (pMORPH13) vector from a VL-k pool. The data 
indicate a ratio of scFv-gIIIct:wtgIHp = 1x1 0 9 phage:6.7xl0 7 phage in Figure 4A, and a ratio 
of scFv-gIIIct:wtginp = 5x1 0 8 phage: 1x1 0 7 phage in Figure 4B. Accordingly, the mean 
number of ScFv per phage in this experiment was approximately 0.3 and 0.1 ^respectively. 

15 Example 3: Display determination of a dicistronic scFv vector, using Cys display 
A protocol as disclosed in Example 1 was carried out (except with using reducing 
agents) for performing a quantitative display analysis of a dicistronic scFv vector 
(pMORPH20), using Cys display. Figure 5A provides expression data of the pMORPH20 
vector from a VL-X pool; and Figure 5B provides expression data of the pMORPH20 vector 

20 from a VL-k pool. The data indicate a ratio of scFv-SS-gmct:wtgIIIp = Ixl0 10 phage:4xl0 7 
phage in Figure 5 A, and a ratio of scFv-SS-gIIIct:wtgIIIp = 5xl0 9 ***phage:2xl0 7 phage in 
Figure 5B. Accordingly, the mean number of scFv per phage in this experiment was 
approximately 0.02 and 0.02 respectively. 
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Example 4: Display determination of a dicistronic Fab vector, using conventional 
display 

A protocol as disclosed in Example 1 was carried out for performing a quantitative 
display analysis of a dicistronic Fab vector (plS40RPH18), using conventional phage display. 

5 Figure 6A provides expression data of the pMORPHlS vector (single Fab of framework 
combination VH2-X1); and Figure 6B provides expression data of the pMORPHlS vector 
(single Fab of framework combination VH3-K1). The data indicate a ratio of Fd- 
gmct:wtglllp == I.xl0 9 phage:2xl0 7 phage in Figure 6A, and a ratio of Fd-glllct: wtglllp = 
lxl 0 8 phagerlxlO 7 phage in Figure 6B. Accordingly, the mean number of Fabs per phage in 

10 this experiment was approximately 0.1 and 0.5, respectively. 

Example 5: Display determination of a dicistronic Fab two-vector system, using Cys 
display 

A protocol as disclosed in Example 1 was carried out (except with using reducing 
agents) for performing a quantitative display analysis of a dicistronic Fab vector in a two-vector 

15 system (pMORPHIO + pBR_C_gHI), using Cys display. Figure 7A provides expression data of 
the pMORPHIO vector system (single Fab of framework combination VH3-K1); and Figure 
7B provides expression data of the pMORPHIO vector system (single Fab of framework 
combination VH2-X1). The data indicate a ratio of VL_CL-SS-gm:wtgIIIp = lxl 0 9 
phage:8xl0 6 phage in Figure 7A, and a ratio of VL_CL-SS-gin:wtgnip = 8xl0 9 phage:3xl0 7 

20 phage in Figure 7B. Accordingly, the mean number of Fabs per phage in this experiment was 
approximately 0.04 and 0.02, respectively. 

Example 6: Analysis of display rates and efficiency in phage ELISAs 

The following table summarizes approximate display rates obtained in Examples 1-5: 
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Table I 



System 


Vectors) 


Approximate display 
rates fig per Phage) 


scFv conventional (monocistronic) 


pMORPH13 


0.1-0.3 


scFv CysDisplay (dicistronic) 


DMORPH20 


0.02 


Fab conventional (dicistronic) 


PMORPH18 


0.1-0.8 


Fab CysDisplay (dicistronic) 


pMORPHXIO + pBR C gffl 


0.02 - 0.04 



From this table, two trends are understood. First, the display rates decrease as much 
as 2.5 to 40 fold when using CysDisplay in lieu of conventional display. Second, the display 

5 rates decrease as much as 5 to 1 5 fold when moving from a monocistronic conventional 

display vector to a dicistronic CysDisplay vector. Accordingly, CysDisplay phage generally 
showed reduced display rates in comparison to phage containing conventional genetic fusions 
of antibody fragments to gin (or glllct fragment). Because it would be highly undesirable to 
work with display rates lower than 0.04 Fabs per phage, the use of a tricistronic Fab vector 

10 that additionally was engineered for CysDisplay was thought not to be possible, based on the 
foregoing trends of decreased display rates. 

Example 7: Display determination of tricistronic Fab vector, using Cys display (single 
vector system) 

A protocol as disclosed in Example 1 was carried out (except with using reducing 
15 agents) for performing a quantitative display analysis of a tricistronic Fab vector (pMORPH23), 
using Cys display. Figure 8A provides expression data of the pMORPH23 vector from a VH3 
+ k/X pool; and Figure 8B provides expression data of the pMORPH23 vector from a VH3 + 
k/X pool. The data indicate a ratio of Fd-SS-gm:wtgmp ^5xl0 l0 phage:3xl0 9 phage in 
Figure 8A, and a ratio of Fd-SS-gm:wtgIIIp = 5x1 0 9 phage: 1x1 0 8 phage in Figure SB. 
20 Accordingly, the mean number of Fabs per phage in this experiment was approximately 0.3 
and 0.1, respectively. 
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As is shown in Table II below, the tricistronic Fab CysDisplay vector (pMORPH23) 
yield improved Fab display rates when compared to the dicistronic Fab CysDisplay system, 
which always needs a second vector providing the Cys-gin construct When using a constant 
amount of phage in the foregoing examples, the signals obtained with the tricistronic system 
5 were higher than those obtained with the dicistronic system. This indicates an increased 
display rate with the tricistronic version, which was unexpected. 



Table n 



System 


Vectors) 


Approximate display 
rates (Ig per Phage) 


scFv conventional 


PMORPH13 


0.1-03 


scFv CysDisplay 


PMORPH20 


0.02 I 


Fab conventional 


PMORPH18 


0.1-0.8 


Fab CysDisplay dicistronic 


pMORPHXIO + pBR C gin 


0.02-0.04 


Fab CysDisplay tricistronic 


PMORPH23 


0.05-03 



Example 8: Comparison of dicistronic and tricistronic Fab Cys Display vectors in 
10 phage ELISA 

Phage preparations (i) anti-Mac 1 I-domain, (ii) Fab Macl-5 and (iii) Macl_A8 were 
expressed from the dicistronic CysDisplay vector (pMORPHXIO + pBR_C jgffl; two-vector- 
system), the tricistronic CysDisplay vector (pMORPH23) and the dicistronic, conventional 
Fab vector (pMORPH18) and displayed on phage. The phage were concentrated and the titer 

15 of the phage preparations was determined. 

Maxisorp wells of a microtiter plate were coated with 100 jxl Macl I-domain protein 
per well (concentration of the antigen solution in PBS: 50 ng/ml) overnight at 4°C. The 
antigen solution was removed and the coated wells were washed with PBS. Next, the 
antigen-coated wells were blocked with 300 fxl 5% MPBST for 1 hour at room temperature. 

20 At the same time, an aliquot of each phage preparation (1 00 \il per well; 7.5E+9 phages) 1:1 
was mixed with 10% MPBST (incl. 0.1% Tween20). The phage were incubated for 1 hour at 
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room temperature. The coated wells were washed 3x with PBS. Then, 200 \i\ of pre-blocked 
phage solution was transferred into each coated well, and incubated for 1 hour at room 
temperature. Then, the phage were removed from the wells, and non-bound phage were 
washed off using PBST and PBS. Next, 100 fd anti-M13-HRP conjugate (1:5000) in 5% 

5 MPBST (inch 0.05% Tween20) was added and incubated for one hour at room temperature. 
Another PBST and PBS wash was performed, and 100 fil POD-Substrate was added. A 
measurement at 370 nm was taken after several minutes in order to quantify the amount of 
anti-Mac 1 phage attached to the antigen in the wells. 

Figure 9 is a bar graph that compares functionality and binding efficiency (functional 

10 Fab display) between dicistronic and tricistronic Fab Cys Display vectors in phage EUSA. 
The improved binding efficiency of the phage resulting from the tricistronic CysDisplay 
vector versus the dicistronic CysDisplay vector confirms the data of the increased display 
rates. Bars 1 and 2 represent independent experiments of the same construct The first two 
bars for each group represent experiments performed with Fab molecule Macl-5; the last two 

1 5 bars for each group represent experiments performed with Fab molecule Macl_A8). 

Example 9: Successful Antibody Library Screening using a tricistronic vector system 
in Cys Display 

Wells of MaxiSorp™ microtiter plates (NUNC) were coated with 15 fig per antigen 
(ICAM-1 protein, rabbit myosin, FTTC-BSA, estradiol-BSA) dissolved in PBS, Using a 
20 tricistrcmic vector as described in Examples 2A and 2B in conjunction with proprietary 
MorphoSys phage display and selection techniques, the results provided in Table HI were 
obtained upon screening a MorphoSys HuCAL® Library. 
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Table III 



Antigen 


Elution 


% primary hits 
2 nd round ! 


% primary hits 
3 rt round 


No. of consolidated, 
specific antibodies 


ICAM-1 protein 


DTT 


0% 


17% 


1 


ICAM-1 protein 


qlydne + TGI 


0% 


60% 


3 


myosin 


DTT 


1% 


29% 


4 


myosin 


qlycine + TGI 


' 14% 


19% 


1 


FTTC-BSA 


DTT 


82% 


100% 


6 


FTTC-BSA 


qlydne + TGI 


92% 


87% 


6 


estradiol-BSA 


DTT 


75% 1 


67% 


6 


estradiol-BSA 


glycine + TGI 


59% 


67% 


3 



The foregoing data confirm that tricistronic vectors of the invention are effective 
5 vehicles for expressing, at a minimum, three functional polypeptide molecules. 



Example 10: Construction of pMOKPH23 vector 

The vector pMORPH23 described here is a derivative of the pCAL vector series 
(WO 97/08320; Knappik et aL 9 2000), which is a modified version of the dicistronic 

10 expression vector pMORPH20 (example 3). A vector map for pMORPH20 is provided in 
Figure 5C and the related nucleic acid sequence is provided in Figure 5D. 

The dicistronic expression vector pMORPH20 was digested with restriction enzymes 
StuJ and Msc/, to remove the scFv-expression module. The resulting blunt end cut vector 
was religated after agarose gel purification and transformed into competent E.coli<^lls. The 

15 intermediate vector product was further modified by replacing the ompA signal sequence 
(Xba/ and Ecoi? V digest) by a oligonucleotide cassette preformed by annealing primer pairs 
A and B coding for the gpIQ signal sequence and introducing a 5' AccI restriction site and a 
3* blunt end. 
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Primer A: 

ctagtatacg agggcaaaaa atgaaaaaac 
gattccgctg gtggtgccgt tctatagcca 

Primer B: 

5 gtcgcagtag tcgctatggc tatagaacgg 

ggaatcgcga acagcagttt tttcattttt 

To obtain the final pMORPH23 library cloning vector, an Afffl-Xbal-Bla-EcoRI 
staffer cassette was introduced by ligation. The stuffer fragment allows eflBcient cloning of 
10 HuCAL Fab fragments by Xbal and EcoRI. An example for tricistronic pMORPHK3- 
HuCALFab vector is shown in Figure 2A and 2B. All three modules in pMORPH23 are 
transcribed as one unit from the lac p/o region. 



tgctgttcgc 
tagcgactac tgcgac 

caccaccagc 
tgccctcgta ta 
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